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3 <110> APPLICANT: Sagami Chemical Research Center, Protegene Inc. 

5 <120> TITLE OF INVENTION: Human proteins having hydrophobic domains and DNAs encoding 
these proteins 

7 <130> FILE REFERENCE: 661924 r\ a . A I 

9 <140> CURRENT APPLICATION NUMBER: US/10/019, 151A U U Qa^AA (i/^UJ^ 

10 <141> CURRENT FILING DATE: 2003-01-31 |f j\J"^U >7 ^ 

12 <150> PRIOR APPLICATION NUMBER: JP 11-178065 J/L -V A&jj^Ji 

13 <151> PRIOR FILING DATE: 1999-06-24 X^y^y/^ * 

15 <160> NUMBER OF SEQ ID NOS : 24 fi^S^^ S^£* 



ERRORED SEQUENCES GOfTBCf BdQfcf^^Pty fiJ^A) 

17 <210> SEQ ID NO: 1 jNTlL^^ i ^ Sfl °fep Am*****^ 

18 <211> LENGTH: 238 41 (f ' 0 

19 <212> TYPE: PRT * fl AxJL. 

20 <213> ORGANISM: Homo sapiens ^X^^' ' 
22 <4 00> SEQUENCE: 1 
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S — > 



<213> ORGANISM: Homo sapiens I Jt^^" 0 ^ 1 ^ I W 

<4 00> SEQUENCE: 7 ^ ^ /ujsn****^ -*~ 



100 Ser Asn Ser Asp Thr Lys Thr Arg Thr Ala Ser Gly Tyr Gly Gly Thr 
E— > 101 325 ' 330 336 

102 Arg Arg Arg 

255 <210> SEQ ID NO: 7 

256 <211> LENGTH: 200 

257 <212> TYPE: PRT 
258 

260 . 

261 Met Ala Ala Ser Met His Gly Gin Pro Ser Pro Ser Leu G@u Asp Ala 

262 15 10 15 

263 Lys Leu Arg Arg Pro Met Val lie Glu lie lie Glu Lys Asn Phe Asp 

264 20 25 30 

265 Tyr Leu Arg Lys Glu Met Thr Gin Asn lie Tyr Gin Met Ala Thr Phe 

266 ' 35 40 ~ 45 

267 Gly Thr Thr Ala Gly Phe Ser Gly lie Phe Ser Asn Phe Leu Phe Arg 

268 50 55 60 

269 Arg Cys Phe Lys Val Lys His Asp Ala Leu Lys Thr Tyr Ala Ser Leu 

270 65 70 75 80 

271 Ala Thr Leu Pro Phe Leu Ser Thr Val Val Thr Asp Lys Leu Phe Val 

272 85 90 " 95 

273 lie Asp Ala Leu Tyr Ser Asp Asn lie Ser Lys Glu Asn Cys Val Phe 

274 100 105 ( 110 

275 Arg Ser Ser Leu lie Gly lie Val Cys' Gly Val Phe Tyr Pro Ser Ser 

276 115 120 125 

277 Leu Ala Phe Thr Lys Asn Gly Arg Leu Ala Thr Lys Tyr His Thr Val 

278 130 135 140 

279 Pro Leu Pro Pro Lys Gly Arg Val Leu lie His Trp Met Thr Leu Cys 

280 145 150 155 160 

281 Gin Thr Gin Met Lys Leu Met Ala He Pro Leu Val Phe Gin He Met 

282 165 170 175 

283 Phe Gly He Leu Asn Gly Leu Tyr His Tyr Ala Val Phe Glu Glu Thr 

284 180 " " 185 ~ 190 

285 Leu Glu Lys Thr He His Glu Glu 

286 195 200 

386 <210> SEQ ID NO: 12 

387 <211> LENGTH: 972 

388 <212> TYPE: DNA 
38 9 <213> ORGANISM: Homo sapiens 

391 <400> SEQUENCE: 12 

392 atggcggccg cggctccggg gaacggccgc gcatcggcgc cccggctgct tctgctcttt 60 

393 ctggttccgc tgctgtgggc cccggctgcg gtccgggccg gcccagatga agaccttagc 120 

394 caccggaaca aagaaccgcc ggcgccggcc cagcagctgc agccgcagcc tgtggctgtg 180 

395 cagggccccg agccggcccg ggtcgagaaa atatttacac cagcagctcc agttcatacc 240 

396 aataaagaag atcctgctac ccaaactaat ttgggattta tccatgcatt tgtcgctgcc 300 

397 atatcagtta ttattgtatc tgaattgggt gataagacat tttttatagc agccatcatg 360 

398 gcaatgcgct ataaccgcct gaccgtgctg gctggtgcaa tgcttgcctt gggactaatg 420 

399 acatgcttgt cagttttgtt- tggctatgcc accacagtca tccccagggt ctatacatac 480 

400 tatgtttcaa ctgtattatt tgccattttt ggcattagaa tgcttcggga aggcttaaag 540 

401 atgagccctg atgagggtca agaggaactg gaagaagttc aagctgaatt aaagaagaaa 600 



4 
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6 



W— > 402 

gatgaagaat ttcaacgaac caaactttta aafcggaccgg gagatgttga aacgggtaca 660^gcataacag tacctcagaa aa 
E — > 403 ^h&ty ****** 

a ctgtggggca ctgcctgtgc 840 acgggattgg cagtaattgg aggaagaatg atagcacaga aaatctctgt cagaactgtg 
E— > 404 gattctggtt tt 972 

792 <210> SEQ ID NO: 21 

793 <211> LENGTH: 2064 

794 <212> TYPE: DNA 

795 <213> ORGANISM: Homo sapiens 

797 <220> FEATURE: 

798 <221> NAME/KEY: CDS 

799 <222> LOCATION: (98)... (559) 
. 801 <400> SEQUENCE: 21 

fEJkA> 802 aaaacagctg ctggagcagc agcggccccc gctcccggga accgttcccg ggccgttgat 60 

803 cttcggcccc acacgaacag cagagagggg cagcagg atg aat gtg ggc aca 112 

804 Met Asn Val Gly Thr 

805 1 5 

806 gcg cac age gag gtg aac ccc aac acg egg gtg atg aac age cgt ggc 160 

807 Ala His Ser Glu Val Asn Pro Asn Thr Arg Val Met Asn Ser Arg Gly 

808 10 15 20 

809 ate tgg etc tec tac gtg ctg gee ate ggt etc etc cac ate gtg ctg 208 

810 lie Trp Leu Ser Tyr Val Leu Ala lie Gly Leu Leu His lie Val Leu 

811 25 30 35 

812 ctg age ate ccg ttt gtg agt gtc cct gtc gtc tgg acc etc ace aac 256 

813 Leu Ser lie Pro Phe Val Ser Val Pro Val Val Trp Thr Leu Thr Asn 

814 40 45 50 

815 etc att cac aac atg ggc atg tat ate ttc ctg cac acg gtg aag ggg 304 

816 Leu He His Asn Met Gly Met Tyr He Phe Leu His Thr Val Lys Gly 

817 55 60 65 

818 aca ccc ttt gag acc ccg gac cag ggc aag gcg agg ctg eta acc cac 352 

819 Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala Arg Leu Leu Thr His 

820 70 75 80 85 

821 tgg gag cag atg gat tat ggg gtc cag ttc acg gec tct egg aag ttc 400 

822 Trp Glu Gin Met Asp Tyr Gly Val Gin Phe Thr Ala Ser Arg Lys Phe 

823 90 95 100 

824 ttg acc ate aca ccc ate gtg ctg tac ttc etc acc age ttc tac act 448 

825 Leu Thr He Thr Pro He Val Leu Tyr Phe Leu Thr Ser Phe Tyr Thr 

826 105 110 115 

827 aag tac gac cag ate cat ttt gtg etc aac acc gtg tec ctg atg age 496 

828 Lys Tyr Asp Gin He His Phe Val Leu Asn Thr Val Ser Leu Met Ser 

829 120 125 . 130 

830 gtg ctt ate ccc aag ctg ccc cag etc cac gga gtc egg att ttt gga 544 

831 Val Leu lie Pro Lys Leu Pro Gin Leu His Gly Val Arg He Phe Gly 

832 135 140 145 

833 ate aat aag tac tgagagtgca gccccttccc ctgcccaggg tggcagggga gggg 600 

834 He Asn Lys Tyr 

835 150 

836 tagggtaaaa ggcatgtgct gcaacactga agacagaaag aagaagcetc tggacactgc 660 

837 cagagatggg ggttgagcct ctggcctaat ttcccccctc gcttccccca gtagecaact 720 

838 tggagtagct tgtagtgggg ttggggtagg ccccctgggc tctgaccttt tctgaatttt 780 

839 ttgatctttt ecttttgett tttgaataga gactccatgg agttggtcat ggaatgggct 840 
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840 gggctcctgg gctgaacatg gaccacgcag ttgcgacagg aggccagggg aaaaacccct 900 

841 gctcacttgt ttgccctcag gcagccaaag cactttaacc cctgcatagg gagcagaggg, 960 .1 / J 

w— > 842 J^ivxS^tt hard rdtvm 

cggtacggct tctggattgt ttcactgtga ttcctaggtt ttttcgatgc cacgcagtgt 1020*rjtgcttttgt gtatggaagc aa 
W— > 843 / 

g tagatggctt ggttccagag 1200^$tgagggggc cagggctgct gccatcctgg cctggtggag gttggggagc tgtaggagag 

E — > 844 caggcttgga gggttaattt acccacagaa tgtgatagta ataggggagg gaggctgctg 1440 

E — > 845 tgggtttaac tcctgggttg gctgttgggt agacaggtgg ggaaaaggcc cgtgagtcat 1500 

E — > 846 tgtaagcaca ggtccaactt ggccctgact cctgcggggg tatggggaag ctgtgacaga 1560 

E — > 847 aacgatgggt gctgtggtcc tctgcaggcc ctcacccctt aacttcctca tacagactgg 1620 

E — > 848 cactgggcag ggcctctcat gtggcagcca catgtggcgt tgtgaggcca ccccatgtgg 1680 

E — > 849 ggtctgtggt gagagtcctg taggatccct gctcaagcag cacagaggaa ggggcaagac 1740 

E — > 850 gtggcctgta ggcactgttt cagcctgcag agaagaaagt gaggccggga gcctgagcct 1800 

E — > 851 gggctggagc cttctcccct ccccagttgg actaggggca gtgttaattt tgaaaaggtg 1860 

E — > 852 tgggtccctg tgtcctcttc caggggtcca agggaacagg agaggtcact gggcctgttt 1920 

E — > 853 tctccctcct gaccctgcat ctcccacccc gtgtatcata gggaactttc accttaaaat 1980 

E — > 854 ctttctaagc aaagtgtgaa taggattttt actccctttg tacagtattc tgagaaacgc 2040 

E — > 855 aaataaaagg gcaacatgtt tctg 2064 



& 



902 <210> SEQ ID NO: 23 

903 <211> LENGTH: 1161 

904 <212> TYPE: DNA 

905 <213> ORGANISM: Homo sapiens 

907 <220> FEATURE: 

908 <221> NAME/KEY: CDS 

909 <222> LOCATION: (278)... (880) 
911 <400> SEQUENCE: 23 



~> 912 acatgagcca ccaaaatggt ggtgttcggg tatgaggctg ggactaagcc aagggattca 60 

913 ggtgtggtgc cggtgggaac tgaggaagcg cccaaggaaa tgaaacacga tttccaaaat 120 

914 gaacttaatc tttcatgaga aactgaggat agagatgtca ataagcagcc actgtttcca 180 

915 cctccccacc tgaagagcta ggaggacaac tacaaagagc ctgactgcct tctcggaatg 24 0 

916 aggagagagg aaaacagcaa cagtatcagt tttcaag atg gca gca tct atg 292 

917 Met Ala Ala Ser Met 

918 1 5 

919 cat ggt cag ccc agt cct tct eta gaa gat gca aaa etc aga aga cca 340 

920 His Gly Gin Pro Ser Pro Ser Leu Glu Asp Ala Lys Leu Arg Arg Pro 

921 10 • 15 20 

922 atg gtc ata gaa ate ata gaa aaa aat ttt gac tat ctt aga aaa gaa 388 

923 Met Val lie Glu lie lie Glu Lys Asn Phe Asp Tyr Leu Arg Lys Glu 

924 25 30 35 

925 atg aca caa aat ata tat caa atg gcg aca ttt gga aca aca get ggt 436 

926 Met Thr Gin Asn He Tyr Gin Met Ala Thr Phe -Gly Thr Thr Ala Gly 

927 40 45 50 

928 ttc tct gga ata ttc tea aac ttc ctg ttc aga cgc tgc ttc aag gtt 484 

929 Phe Ser Gly He Phe Ser Asn Phe Leu Phe Arg Arg Cys Phe Lys Val 

930 55 60 65 

931 aaa cat gat get ttg aag aca tat gca tea ttg get aca ctt cca ttt 532 

932 Lys His Asp Ala Leu Lys Thr Tyr Ala Ser Leu Ala Thr Leu Pro Phe 

933 70 75 80 85 

934 ttg tct act gtt gtt act gac aag ctt ttt gta att gat get ttg tat 580 

935 Leu Ser Thr Val Val Thr Asp Lys Leu Phe Val He Asp Ala Leu Tyr 

936 90 95 100 
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nla 


Pho 

xrne 


i 1 IX 


Lys 




Q4 9 

y ft c. 
















1 9 S 










J. ~> w 










943 


aat 


gga 


cgc 


ctg 


gca 


ace 


aag 


tat 


cat 


acc 


gtt 


cca 


ctg 


cca 


cca 


aaa 


724 


944 


Asn 


Gly 


Arg 


Leu 


Ala 
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Lys 


Tyr 


His 


Thr 


Val 


Pro 


Leu 


Pro 


Pro 


Lys 




945 




135 










140 










145 












946 


gga 


agg 


gtt 


tta 


ate 


cat 


tgg 


atg 


acg 


ctt 


tgt 


caa 


aca 


caa 


atg 


aaa 


772 


947 


Gly 


Arg 


Val 


Leu 


He 


His 


Trp 


Met 


Thr 


Leu 


Cys 


Gin 


Thr 


Gin 


Met 


Lys 




948 


150 










155 










160 










165 




949 


tta 


atg 


gcg 


att 


cct 


eta 


gtc 


ttt 


cag 


att 


atg 


ttt 


gga 


ata 


tta 


aat 


820 


950 


Leu 


Met 


Ala 


He 


Pro 


Leu 


Val 


Phe 


Gin 


He 


Met 


Phe 


Gly 


He 


Leu 


Asn 




951 










170 










175 










180 






952 


ggt 


eta 


tac 


cat 


tat 


gca 


gta 


ttt 


gaa 


gag 


aca 


ctt 


gag 


aaa 


act 


ata 


868 


953 


Gly 


Leu 


Tyr 


His 


Tyr 


Ala 


Val 


Phe 


Glu 


Glu 


Thr 


Leu 


Glu 


Lys 


Thr 


He 





954 185 190 . 195 

955 cat gaa gag taaccaaaaa aatgaatggt tgctaactta gcaaaatgaa gtt 920 

956 His Glu Glu 

957 200 

958 tctataaaga ggactcaggc attgetgaaa gagttaaaag taactgtgaa caaataattt 980 
W— > 959 

gttctgtgcc ttttgcctgg tatatagcaa atactcaaaa agtattcaat aattcaatca 104(Tataaatataa gtttcatctt ac 
E— > 960 t » 1161 
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/o/ol%/SlA 



<210> 17 

<211> 1167 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (187) . . . (903) 
<400> 17 

cttgcctctg ggaaggaaat acattataga gtgggaaatt tttatcattt tgaaccaaga 
ttcttctaaa gaaagaaaga ctgattaata aaatgtggca gctgtgctct tcaaggcatt 
tatagtgtat atagttttag aaaaacagtc ccaccactta agcatagatg taatttacta 
ataaaa atg att ctg ctt gtg att ctt gca ttt tat ctg tgg cag gtg 
Met lie Leu Leu Val lie Leu Ala Phe Tyr Leu Trp Gin Val 
1 5 10 

gac atg ttg agt gaa att aac att get ccc egg att etc ace aat ttc 
Asp Met Leu Ser Glu lie Asn lie Ala Pro Arg lie Leu Thr Asn Phe 
15 20 ' 25 * • 30 

act gga gta atg cca cct cag ttc aaa aag gat ttg gat tec. tat ctt 
Thr Gly Val Met Pro Pro Gin Phe Lys Lys Asp Leu Asp Ser Tyr Leu 
/vu^a>^^ ig &r 40 45 

aaa act cga tea cca gtc act ttc ctg tct gat ctg cgc age aac eta 
Lys Thr Arg Ser Pro Val Thr Phe Leu Ser Asp Leu Arg Ser Asn Leu 
~ 50 '55 '60 



60 
120 
180 
228 



276 



324 



372 



/o/qI%I$(\ 



tgt gaa ggc tat gag tec tct gaa gac cag tat gta eta aga ggt tct 
Cys Glu Gly Tyr GJ_.u Ser Ser Glu Asp Gin Tyr Val Leu Arg Gly Ser 
115 * X&T HO 125 



501 



RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/019 , 151A 



DATE: 02/25/2004 
TIME: 08:52:44 



Input Set : A:\01997.025700.txt 

Output Set: N:\CRF4\02252004\J019151A.raw 



Invalid Line Length: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:l; Line(s) 5 
Seq#:12; Line(s) 402,403 
Seq#:21; Line(s) 842,843 
Seq#:23; Line(s) 959 
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VERIFICATION SUMMARY DATE: 02/25/2004 

PATENT APPLICATION: US/10/019 , 151A TIME: 08:52:44 



Input Set : A:\01997.025700.txt 

Output Set: N:\CRF4\02252004\J019151A.raw 



L:44 M:332 E: 
M:332 Repeated 
L:101 M:332 E: 
L:261 M:330 E: 
L:402 M:360 E: 
L:402 M:334 W: 
L:403 M:336 W: 
L:403 M:254 E: 
L:403 M:320 E: 
M:254 Repeated 
L:404 M:252 E: 
L:491 M:336 W: 
L:494 M:336 W: 
L:497 M:336 W: 
L:500 M:336 W: 
L:503 M:336 W: 
L:506 M:336 W: 
L:509 M:336 W: 
L:512 M:336 W: 
L:515 M:336 W: 
L:518 M:336 W: 
L:521 M:336 W: 
L:524 M:336 W: 
L:527 M:336 W: 
L:571 M:336 W: 
L:574 M:336 W: 
L:577 M:336 W: 
L:580 M:336 W: 
L:583 M:336 W: 
L:586 M:336 W: 
L:589 M:336 W: 
L:592 M:336 W: 
L:595 M:336 W: 
L:598 M:336 W: 
L:601 M:336 W: 
L:604 M:336 W: 
L:607 M:336 W: 
L:610 M:336 W: 
L:614 M:336 W: 
L:842 M:360 E: 
L:843 M:360 E: 
L:842 M:334 W: 
L:843 M:336 W: 
L:843 M:334 W: 
L:844 M:336 W: 
L:844 M:254 E: 
M:254 Repeated 
L:855 M:252 E: 



/ 



[32) Invalid/Missing Amino Acid Numbering, SEQ ID:1 
in SeqNo=l y 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 2 
(2) Invalid Amino Acid Designator, NUMBER OF INVALID KEYS : 1 y 
Sequence data overflow, line data truncated, for SEQ ID#:12^ 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 19 
Invalid Amino Acid Number in Coding Region, SEQ ID: 12 
No. of Bases conflict, LENGTH : Input : 960 Counted:741 SEQ:12 
(1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 6 
in SeqNo=12 y 
No. of Seq. differs, <211> LENGTH : Input : 972 Found:753 SEQ: 12 
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Coding 


Region, 


SEQ 


ID: 


17 


Amino 


Acid 


Number 


in 
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Coding 


Region, 


SEQ 


ID: 


18 


Amino 


Acid 


Number 


in 
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18 



Sequence data overflow, line data truncated, for SEQ ID#:21 



/ 



Sequence data overflow, line data truncated, for SEQ ID#:21 

(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 19 

Invalid Amino Acid Number in Coding Region, SEQ ID: 21 

(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 20 

Invalid Amino Acid Number in Coding Region, SEQ ID: 21 

No. of Bases conflict, LENGTH : Input :14 40 Counted: 1020 SEQ:21 

in- SeqNo=21 

No. of Seq. differs, <211> LENGTH : Input : 2064 Found:1644 SEQ:21 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/019 , 151A 



DATE: 02/25/2004 
TIME: 08:52:44 



Input Set : A:\01997.025700.txt 

Output Set: N:\CRF4\02252004\J019151A.raw 



L:959 M:360 E: Sequence data overflow, line data truncated, for SEQ ID#:23 

L:959 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 19 

L:960 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 23 

L:960 M:254 E: No. of Bases conflict, LENGTH : Input : 1161 Counted:981 SEQ:23 

L:960 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 1161 Found:981 SEQ:23 
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